Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
También | 859 | 42 | 1 | 42.0000 |
Y | 2102 | 76 | 3 | 25.3333 |
Las | 2036 | 184 | 8 | 23.0000 |
Estas | 332 | 23 | 1 | 23.0000 |
La | 8205 | 661 | 29 | 22.7931 |
Al | 785 | 45 | 2 | 22.5000 |
Los | 3536 | 292 | 13 | 22.4615 |
El | 9144 | 664 | 32 | 20.7500 |
Este | 1204 | 74 | 4 | 18.5000 |
Aunque | 340 | 18 | 1 | 18.0000 |
Durante | 373 | 17 | 1 | 17.0000 |
Es | 1895 | 68 | 4 | 17.0000 |
Esta | 1095 | 81 | 5 | 16.2000 |
Estos | 401 | 32 | 2 | 16.0000 |
close | 260 | 16 | 1 | 16.0000 |
No | 2287 | 90 | 6 | 15.0000 |
Entre | 305 | 15 | 1 | 15.0000 |
En | 6062 | 188 | 13 | 14.4615 |
Se | 1814 | 115 | 8 | 14.3750 |
pero | 3480 | 100 | 7 | 14.2857 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
tipos | 238 | 1 | 19 | 0.0526 |
objeto | 238 | 1 | 14 | 0.0714 |
vamos | 223 | 1 | 14 | 0.0714 |
acerca | 502 | 2 | 23 | 0.0870 |
millones | 920 | 5 | 57 | 0.0877 |
creo | 300 | 1 | 11 | 0.0909 |
capaces | 129 | 1 | 11 | 0.0909 |
Dec | 66 | 1 | 10 | 0.1000 |
Jan | 77 | 1 | 10 | 0.1000 |
dentro | 740 | 2 | 19 | 0.1053 |
capaz | 159 | 1 | 9 | 0.1111 |
2015 | 244 | 1 | 9 | 0.1111 |
miles | 232 | 1 | 9 | 0.1111 |
ninguno | 90 | 1 | 8 | 0.1250 |
joven | 293 | 2 | 16 | 0.1250 |
iba | 170 | 1 | 7 | 0.1429 |
cerca | 357 | 2 | 14 | 0.1429 |
tasas | 81 | 1 | 7 | 0.1429 |
reino | 78 | 1 | 7 | 0.1429 |
boletín | 35 | 1 | 7 | 0.1429 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II